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GONCHAR-STAHL’S p 2 -THEOREM AND ASSOCIATED 
DIRECTIONS IN THE THEORY OF RATIONAL 
APPROXIMATION OF ANALYTIC FUNCTIONS 


E. A. RAKHMANOV 

Abstract. Gonchar-Stahl’s p 2 -theorem characterizes the rate of con¬ 
vergence of best uniform (Chebyshev) rational approximations (with free 
poles) for one basic class of analytic functions. The theorem itself, its 
modifications and generalizations, methods involved in the proof and 
other related details constitute an important subfield in the theory of 
rational approximations of analytic functions and complex analysis. 

The paper briefly outlines essentials of the subfield. Fundamental 
contributions by A. A. Gonchar and H. Stahl are in the center of the 
exposition. 
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1. Introduction. Statement of the theorem 

Gonchar-Stahl theorem characterizes the rate of convergence of best uni¬ 
form rational approximations (with free poles) for one basic class of ana¬ 
lytic functions. Its proof combines constructions and methods from differ¬ 
ent branches of classical analysis and approximation theory. This variety 
of significant connections explains the fundamental role of the theorem in 
approximation theory and complex analysis. Some of those facts and con¬ 
nections are briefly discussed below. 

1.1. Walsh theorem. One of the main predecessors of Gonchar-Stahl the¬ 
orem is the well known J. L. Walsh theorem [63] of the 1930s on the approx¬ 
imation of an arbitrary element / of an analytic function on a continuum E 
of the (extended) complex plane (see also the book [64] and [11]). 

We consider the distance from / to the class of rational functions of order 
n in the uniform metric on E, 

Pn(f) =Pn(f,E ) = min max \f(z) — r(z)\, (1) 

reK n 26 -E 

where R n is the set of all rational functions r n = P n /Q n of order ^ n 
( P n ,Qn £ Rn - polynomials of degree ^ n). It was a well known fact 
that p n (f) —>■ 0 as n —>■ oo. The problem was to determine the rate of 
convergence. Further, it was known that the rate is at least geometric and 
so, the problem was to estimate (asymptotically) the order of the associated 
progression. 

The assumption that / is analytic on E means that there exists a domain 
hi containing E such that / £ i/(fl); that is, / is holomorphic (analytic and 
single-valued) in fl. 

Walsh’s theorem asserts that / € H(Q) implies that 

ES PnU) 1,n ^P = e- 1,C{E ' F \ (2) 

71—>-00 

where F = dVL is the boundary of Vt and C(E, F ) is the capacity of the 
condenser ( E,F ). We will assume further that the continuum E has con¬ 
nected complement. If in addition II is simply connected, then the number 
1/p = 1 /p(E,F) is also known as the modulus of the ring domain Q\E 
(there is a conformal mapping of II on {p < \z\ < 1 }). 

In 1959 V. D. Erohin [9] presented some examples proving that this 
estimate is sharp; that is, it cannot be improved without further restric¬ 
tions on /. In particular, he constructed a function / € 77(14), where 
H = {z:|z|<i 2 }isa disk, such that for approximations of / on a 
smaller disk E = D r = {z : \z\ ^ r} equality in (1) holds (it follows that 
for this function / rational approximations are not essentially better than 
polynomial approximations of the same degree). This construction may be 
modified to prove that for any domain and any continuum E C II there 
exist / € such that equality in ( 1 ) holds (for further details see the 

book [64] by Walsh and a review by S. N. Mergelyan included in the Russian 
translation of this book [ 66 ] concerning the progress in this direction made 
approximately between 1930 and 1960). 
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We note that in all those example there is only a small subsequence of 
natural numbers along which the equality in (2) is reached. It was deter¬ 
mined later that “in average” the rate of convergence is essentially better 
than in (2) for any function / € iL(I2); see Section 1.2 below. 

This means that the whole class H(Ll) always contains functions with 
significantly irregular behavior of the sequence of rational approximations. 
In this context Gonchar-Stahl’s theorem essentially asserts that the behavior 
of rational approximations is necessarily regular for functions which have 
unlimited analytic continuation outside a set of singularities of zero capacity. 
To formally state the theorem we need the following definitions. 

1.2. Gonchar—Stahl p 2 -theorem. Let f \e be an element of analytic func¬ 
tion, which we want to approximate. If analytic continuation of this element 
outside of E has branch points, then there exist different (maximal) do¬ 
mains II, where f\ e has holomorphic extension. Any of those domains may 
be used in combination with the Walsh inequality (2) and, therefore, p in 
this inequality may be replaced by the following constant p(f) called ex¬ 
tremal modulus associated with the holomorphic function (analytic element) 
/ € H(E) 

p(f) = p(f, E) = \ni{e-y c ^ :F = dQ, f e H(Sl), E CllcD}. (3) 

There also exists a unique extremal domain J), which satisfies the condition 
p(f,E ) = inf{exp{— 1/C(E, dfl)}} and is maximal among all such domains 
with this condition. 

Let A(T>) be the class of all analytic elements in a domain PcC, which 
admit analytic continuation along any path in this domain. We note that 
this is one of the basic classes of analytic functions. For instance, solutions 
of differential equations with coefficients in H(T>) belong to A(T>) (things 
are similar for all natural classes of equations). 

The following is the Gonchar-Stahl p 2 -theorem 

Theorem 1. Let T> be a domain in the extended plane such that cap(C \ 
T>) = 0. Let E C T> be a continuum with connected complement and f € 
H(E)nA{V). Then 

lim p n (f, E) l / n = p(f,E) 2 (4) 

n—o 

In particular, the limit in the left hand side exists. 

This is not the most general form of the theorem; the conditions on E may 
be essentially relaxed, but for the purposes of our discussion this version is 
representative enough. In the sequel we consider mostly cases when E is 
an interval or a disk (including the degenerated or local case when E is a 
point). 

There is also Gonchar’s earlier version [17] of the theorem related to 
Markov-type functions. The Markov case is simpler, but it is not a par¬ 
ticular case of the Theorem 1 above (see also [19]). This case is discussed in 
some details separately in Section 2. 

The proof of the general version of the theorem was presented in papers 
by H. Stahl [54] and A. A. Gonchar and the author [29]. In [54] the upper 
estimate in (4) was proven and in [29] the corresponding lower estimate was 
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obtained. Actually, a general method of rational approximation was devel¬ 
oped in those two papers and solutions of several longstanding problems 
in approximation theory were obtained there as an immediate application 
of the method. The p 2 -theorem was one of the classical applications of 
the method. Stahl’s paper [54] contains another classical result; namely, 
Stahl’s celebrated theorem on the convergence of diagonal Pade approxi- 
mants for functions with branch points (corresponding with the degenerate 
case E = { 00 } in the p 2 -theorem). The so-called “1/9 problem” on best ra¬ 
tional approximation of the exponential function on a semi-axis was solved 
in [29] (it was obtained as a corollary of a general theorem on Chebyshev 
approximation of a sequence of analytic functions). Many other problems 
were investigated later by the same method or its modifications (see, for ex¬ 
ample, [8], [31]). Further applications and generalizations are coming (some 
of them are discussed below in this paper). For details and other references 
see the recent reviews [31], [5], [49], and also JAT [32], [57]. 

Our purpose is to present the essentials of the method mentioned above. 
It is not possible to cover all the significant details. So, we have selected 
some of them for the discussion. To this end we use the p 2 -theorem and its 
version for Markov-type functions as a starting point of this discussion. It 
is also used to create a general context in the first part of the paper. 

1.3. Contents of the paper. In the next Section 2 we outline the proof of 
Gonchar’s p 2 -theorem for Markov-type functions (Theorem 2 below). The 
proof of this theorem [17] is essentially simpler than the proof of Theorem 1. 
The simplicity of this situation makes it possible to briefly discuss the main 
components of the proof in some details. In this connection we also mention 
Stahl’s theorem on the rate of rational approximation of |x| on [—1,1] and 
results and conjectures by Gonchar on the problem of characterizing classes 
of analytic functions by the rate of their rational approximation. 

In Section 3 we discuss some details related to Stahl’s theorem on the 
convergence of Pade approximants for functions with branch points and, in 
particular, the characterization of minimal compact sets for functions with 
a finite number of branch points. This is a good ground for illustrating the 
geometric component of the method. Some new results are then presented 
on stability of convergence in Stahl’s theorem. Finally some new conjectures 
are presented on asymptotics of complex orthogonal polynomials (related to 
problems of convergence of Hermite-Pade approximations). 

In the rest of this introduction we make some comments on the main com¬ 
ponents of the method of proof of the p 2 -theorem. Hopefully this brief review 
may represent to some extent the essence of the method. In the conclusion 
we also make a few more comments on its connections and applications. 

1.4. Brief description of the method. Now, we describe briefly the main 
components of the method including, in particular, a construction of “near 
best” approximations. 

1.4.1. Interpolation by rational function with free poles. We begin by select¬ 
ing a triangular table of interpolation nodes on E, whose n-th row contains 
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2n + 1 points, and find the corresponding rational function of order n in¬ 
terpolating / at the selected nodes. Now we need to select nodes to obtain 
“near best” approximation. 

We note that the proof of Walsh’s theorem was based on interpolation 
with fixed poles. In our “free pole approximations” method, poles are not 
known in advance and have to be determined from the interpolating ta¬ 
ble. To decide which nodes are “near optimal” (see [31]) we need precise 
information on the location of the poles of the interpolating functions. 

The fundamental fact is that the denominators of the approximations 
(polynomials whose roots are poles of the approximations) satisfy certain 
complex (nonhermitian) orthogonality relations. Information on the poles 
has to be derived from those orthogonality relations and we come to a prob¬ 
lem of asymptotics (in a typical case is the weak-* zero distribution) for a 
sequence of complex orthogonal polynomials. 


1.4.2. Asymptotics of complex orthogonal polynomials. The method for 
studying the zero distribution of complex orthogonal polynomials based on 
ingenious potential theoretic arguments was created by H. Stahl in [53], [54] 
and then substantially generalized in [29]. The generalized Stahl method 
(the so-called GRS-method) reduces the problem of asymptotics of orthog¬ 
onal polynomials to an equilibrium problem for the logarithmic potential. 

This equilibrium problem is essentially different from “standard” equi¬ 
librium problems of complex analysis related to minimization of an energy 
functional in the class of measures on a compact “conductor”. Robin’s mea¬ 
sure of a compact set in the plane and the equilibrium (signed) measure 
(distribution) of a conductor (a pair of disjoint compact sets) are classical 
examples. The proof of Walsh’s theorem was based on condenser equilibrium 
distribution. 

In the case of complex orthogonal polynomials we encounter a different 
kind of equilibrium which may be defined as equilibrium in a conducting 
domain. 


1.4.3. Equilibrium in a conducting domain. This is a class of problems which 
may be described as problems of critical points of an energy functional. 
Any such critical point is naturally interpreted as an equilibrium position 
of an “electric charge” on an open “conductor”; the associated equilibrium 
is always unstable. A classical example of such problems is Chebotarev’s 
problem, which asks for a continuum of minimal logarithmic capacity in the 
class of all continua containing a given set of points. The Robin measure 
of Chebotarev’s continuum is an equilibrium distribution in the conducting 
plane with a finite number of insulating points. 

In particular, the proof of the p 2 -theorem is related to some generaliza¬ 
tion of the Chebotarev’s problem (for the Green potential). A local version 
of the problem (Pade approximants) is related to a generalization of the 
standard Chebotarev problem. The solution of Chebotarev-type problems 
may be normally described in terms of trajectories of a quadratic differential 
associated with the problem. In many cases there are equivalent reformu¬ 
lations in terms of moduli of families of curves so that this component of 
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the method belongs also to geometric function theory or even to differential 
geometry. 

In [50]—[54] H. Stahl observed that the potentials of Chebotarev-type sys¬ 
tems of curves satisfy certain symmetry property (now called 5-property, 
i.e., the equality of the normal derivatives of the equilibrium potential in 
two opposite directions) and this property may be directly used to study 
complex orthogonal polynomials. He generalized the problem to include 
curves with 5-properties (5-curves) related to extremal cuts for Pade ap- 
proximants of functions with branch points. 

In [29] 5-curves with an external field were part of the method. In the 
general situation, the existence of an 5-curve in a given class of functions 
may be the key part of the whole problem (see [49]). 

1.4.4. Lower bounds for approximations. The first three parts of the method 
produce an “optimal” sequence of rational approximations to /. As an 
immediate corollary it gives the upper bound in (4). The corresponding 
lower bound is obtained in [29] using special properties of the constructed 
sequence of optimal approximations. The argument used is rather general 
and may be stated as a separate theorem. The idea of the method was 
contained in one of Gonchar’s earlier papers [10], [12], see also the review [31]. 

1.4.5. Applications and connections. The method outlined above in connec¬ 
tion with the /^-theorem has potentially a larger circle of applications. The 
two components in the proof are especially important for applications, they 
are the asymptotics of complex orthogonal polynomials and the related 5- 
equilibrium problems. 

Orthogonal polynomials are the key to a great variety of applications. 
Together with the traditional ones in approximation theory, numerical anal¬ 
ysis, and spectral theory, many new applications were found, in particular 
in mathematical physics, in the last two or three decades. 

New classes of problems enter the theory related with random matrices 
and statistics among other fields. New methods have been created in the 
theory of orthogonal polynomials, in particular, steepest descent for ma¬ 
trix Riemann-Hilbert problems. New versions of old methods such as the 
Liouville-Green-Steklov method (a.k.a. WKB) were developed. In all these 
cases an 5-equilibrium configuration presenting a geometrical component of 
the problem plays a key role. 

The geometrical component originated by the existence problem for 5- 
equilibrium configurations leads to an environment somewhat similar to the 
one related to general moduli problems in geometric function theory (mod¬ 
uli of families of curves, quadratic differentials, critical trajectories). This 
part is often present in difficult problems. For instance, many important 
questions on matrix 5-problems related to the study of Hermite-Pade poly¬ 
nomials are open; see [2], [49]. 

Finally, equilibrium problems create certain connection to integrable sys¬ 
tems (solutions theory). In many cases such problems related to approx¬ 
imations are similar to those that come from mathematical physics. For 
instance, some equilibrium problems associated with the p 2 -theorem and its 
generalizations are surprisingly close to problems originated in the study of 
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KdV or NLS equations by means of the inverse scattering transform method. 
For some further comments see [49]. 

2. Approximation of Markov-type functions 

Markov-type functions f(z), which we call M-functions in the sequel, are 
Cauchy transforms of positive measures with compact support on the real 
line 

/(*):= [ zen = C\F, (5) 

Jf t- z 

where F is a (finite) interval (we may consider that F is the minimal interval 
containing the support of a). 

In particular, an M-function / belongs to M(C \ e),e C 1, if e is finite 
and the jumps of / across the branch-cuts on the real line have constant 
argument and are integrable. Note that the branch-cuts along M constitute 
the boundary of the associated extremal domain. Thus, the classes of M- 
functions and M(C \ e) with cap e = 0 are overlapping but none of them 
contains the other. 

Now, we fix an interval E of the real line not intersecting F and consider 
best rational approximations to / on E. Let p n (f,E ) be the distance from 
/ to the class K„ in the uniform norm on E (see (1)). 

2.1. Gonchar’s /r-theorem for Markov functions. The following is 
Gonchar’s version of the /3 2 -theorem for M-functions. 

Theorem 2. Let cr'{x) = dcr/dx > 0 almost everywhere on F, then 

lim p n (f,E) 1 / n = p(f,E) 2 (6) 

ft—>• OO 

Main components of the proof. 

2.1.1. Interpolation. We begin with an arbitrary triangular table of points 
{Cfc,n} C E, where n is a natural number and k = 1, 2,... 2n for a fixed n. 
We define W n (z) = njt=i(z - C k,n)- 

Next we define the n-th order multipoint Pade approximant r n to / asso¬ 
ciated with the interpolation table {Cfc,n} (see, for example, [15], [16], [21]). 
For technical reasons it is convenient to use exactly 2 n interpolation points 
and then use the interpolating function with the condition r n (oo) = 0. 

For each n there exist a pair of polynomials P n € P n -i and Q n € P n such 
that Q n ^ 0 and the condition 

F n (z) = (Q n f - P n )(z)/W n (z) € H(E) 

(F n is analytic on E) is satisfied. Indeed, the last condition is equivalent 
to a system of 2 n linear homogeneous equations for 2n + 1 coefficients of 
the polynomials P n € P n -i and Q n € P n (in case of distinct nodes those 
equations are ( Q n f — P n )(Ck,n) = 0) k = 1,2,... 2 n). Such a system always 
has a nontrivial solution. This proves the existence of required polynomials 
Pn j Qn ^ 0. 

We set r n = P n /Q n . This function does not necessarily interpolate / 
at all nodes (common zeros between P n and Q n are possible where the 
interpolation may be lost). For Markov-type functions this cannot happen 
as it follows from subsequent considerations. 
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2.1.2. Orthogonality. Hermite interpolation formula. The denominator Q n 
satisfies the following orthogonality conditions 


L 


Q n (x)x J 


daft) 


= 0, j = 0,1,... ,n - 1. 


(7) 


W n (t) 

The following identity, called Hermite interpolation formula, is also impor¬ 
tant 

W n (z) f Q 2 n (t) daft ) 


f(z) - r n (z ) = 


= 0 , 


z £ Q. 


( 8 ) 


Qn( z ) Jf W n (t) Z~t 
The proof of (7) is obtained integrating z 3 F n (z) over a contour C sep¬ 
arating F from E and oo. In particular, (7) implies that the zeros of Q n 
are simple and belong to F. After that (8) follows from Cauchy’s integral 
representation for the function Q n F £ i7(ExtC), where ExtC denotes the 
unbounded connected component of the complement of the contour C. 


2.1.3. Zero distribution. Balayage. Conditions (7) present a model situa¬ 
tion of orthogonality with varying weights on the real line. The assertion 
on the asymptotics of the associated orthogonal polynomials Q n is for¬ 
mulated in terms of weak-* convergence of the normalized counting mea¬ 
sure. The counting measure of a polynomial P is defined as the sum of 
unit masses at the zeros of P (counting multiplicities) and is denoted by 
X (P) = J2p(q= o^(C) (5(C) is the unit measure supported at the point (). 

Now we assume that the interpolation table has a limit distribution (limit 
density) represented by a unit positive measure p on E. More precisely, this 
assumption means that the sequence X(W n )/2n is weak-* convergent to the 
measure p as n -A oo. We denote this fact by ^ X ( W n ) A/i. 

The basic fact regarding free poles real interpolation of M-functions is 
that, if the interpolation table has limit density p, then the denominators 
Q n have the limit distribution A, which is the balayage of p from E onto F. 
Formally, 

— X (W n ) A p implies that — X (Q n ) A A, (9) 
2 n n 

where A is the unit measure on F defined by the condition 

U x (x ) — U^fx) = Cp = const, x £ F (10) 

(by U u (x) = — JTog |a; — t\dn(t) we denote the logarithmic potential of a 
measure v). 


2.1.4. Convergence. Upper bound for the rate on E. Comparing the bound¬ 
ary values on F and singularities on E, it is directly verified that U^~ x {z) + 
Cp = G M (z) = f g(z,t)dp(t ) is Green’s potential of p with respect to the 
domain = C \ F ( g(x,t ) is Green’s function for fl with pole at t). It 
follows by (8) and (9) that the interpolating sequence r n (z) associated with 
an interpolation table having limit density p verifies 

\f(z) — r n ( 2 :)| 1 / ri —>• exp{— 2G fl (z)}, uniformly for z£0\E (11) 

and, moreover, 

max \f(x) — r n (x)\ 1//n -A- exp {—2 min G^(x)}. (12) 

xGE xGE 
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To obtain the best possible estimate from (12), we need to find a measure fj,, 
which maximizes w(p) = min^g^ G^(x) in the class of all unit measures p 
on E. The problem is well known in classical complex analysis; its solution 
is p = A e, Green’s equilibrium measure on E relative to = C \ F. A 
characteristic property of this measure is 

U x (x ) — U fl (x) = Ce = const, x € E. (13) 

Relations (10) and (13) together mean that the pair of measures \e = A 
and p = Xe form the equilibrium distribution for the condenser (E, F ). The 
capacity C(E,F ) of the condenser is defined as C(E,F) = 1/w. In terms 
of equilibrium constants in (10), (13) we have w = Ce — Ce- From here 

lim n ^ 00 p n (/) 1 /« ^ p(/) 2 . 

We note that the equilibrium problem related to this situation is the stan¬ 
dard equilibrium for a plane condenser (pair of disjoint compact sets) and, 
thus, we do not seem to have a nonstandard ^-equilibrium problem here. 
The reason is that on the real axis any equilibrium potential is automatically 
symmetric with respect to the real axis (normal derivatives of a potential in 
two opposite directions are equal). So, the associated 5-property is actually 
following from the symmetry of the situation. 

2.1.5. Lower bound and strong asymptotics. It follows directly from ( 8 ) that 
/ — r n is real and has exactly 2 n zeros on E. Thus, it makes 2n + 1 oscilla¬ 
tions on E, whose amplitudes are asymptotically estimated by (11). These 
estimates make it possible to use the classical Ch. De la Vallee-Poussin in¬ 
equality [60] (instead of Gonchar’s general complex argument mentioned 
above) to obtain the corresponding lower bound. 

Moreover, under the Szego condition [59] on the measure a it is possible to 
slightly modify the interpolation nodes (the measure p) in such a way, that 
the difference / — r n is asymptotically equioscillating. Then, the application 
of Ch. De la Vallee-Poussin’s estimates will infer the strong asymptotics for 
the error of the best approximations 

Pn(f) = 'y(cr,E)p(f) 2n {l+£ n ), (14) 

where e n —>• 0 as n —>• oo and 7 ( 17 , E) is an explicit constant (the same is 
true for the error of interpolation max^ &E | f(x) - r n (x) |). 

Details related to this and other similar results may be found in the 
book by H. Stahl and V. Totik [56] on general orthogonal polynomials. One 
example of a more complex (but still Markov) situation presented by Stahl’s 
theorem on the rational approximation of yfx on [0,1] is in Section 2.2 below. 

For a rather general class of analytic functions (including functions with 
complex branch points) A. Aptekarev [1] proved a theorem on the exact 
constants of approximation by rational functions of order ^ n. In particular, 
he obtained a relation of type (14) for the functions from this class. His 
theorem yields the following formula of strong asymptotics for the error 
Pn = Pn(e~ x ) of the best uniform approximation on the semi-axis [0,+ 00 ) 
to the function e~ x with rational functions of order ^ n: 

p n = 2v n+1 / 2 (l + o(l)), 


n —y 00 


(15) 
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where v is the so-called Halphen constant (see [29], [35]). Formula (15) 
proves a conjecture of Magnus [35] on the exact constant of rational approx¬ 
imation of function e~ x on the semi-axis [ 0 , +oo). 

The method in [1] was based on a study of strong asymptotics for complex 
orthogonal polynomials using steepest descent for matrix Riemann-Hilbert 
representation of such polynomials. In this connection see also [4]. 

See also [38], where both steepest descent for matrix Riemarm- Hilbert 
and WKB are used. 


2.1.6. Generalization. Equilibrium measure. Formulae (9)-(10) were proven 
in the original papers [19] and [17] only for the case p = A e, which was 
enough to conclude the proof of Theorem 2. 

More general orthogonal polynomials Q n , defined by 



e" 2 n Vn(x) x j da (t) =0, 


j = 0 , 1 ,... ,n- 1 , 


(16) 


were studied in the paper [26] by A. A. Gonchar and the author with the 
following result (simplified version) 


Theorem 3. Let (j'(x) = da/dx > 0 almost everywhere on F and sup¬ 
pose that the sequence p n (%) converges to <p(x) uniformly on F. Then 
^ X (Q n ) ~> A, where A = A^, is the equilibrium measure of F in the 
external field <p(z), which is a unit measure on F defined by 

U x (x) + <p(x) = C = const, x € supp A, U x (x) + <p(x) C, x € F. 

(17) 


This theorem was, probably, the first general result on zero distribution 
of orthogonal polynomials with varying weights. 


2.2. Stahl’s theorem on approximation of |x| on [—1,1]. The problem 
of estimates for p n = p n (\x\, [— 1,1]) was introduced by D. J. Newman in 
1964 [39], who proved that e -C1 ^” ^ p n ^ e - c 2 \AI with some c\ ^ C 2 > 0. 

It is easy to see that p 2 n = Pn(y/x, [0,1]), so the problem is reduced to 
the approximation of yfx on [ 0 , 1 ]. 

Representing the function yfz in of the domain {|z| < 2} \ (—2,0] by the 
Cauchy integral and defining 

f( z )-=-[ -^—7 dt , 2 € LI = C\ [-1,0] (18) 

vr 7[- ll0 ] ^ - t 

we find that g(x) = y/x — f(x) is analytic on [ 0 , 1 ] and, therefore, its rational 
approximations converge to g geometrically. This, together with Newman’s 
estimate, implies that p n (yfx , [ 0 , l])/p n (f(x), [ 0 , 1 ]) -A 1 as n — > 00 . 

So the problem is reduced to the study of the best rational approxima¬ 
tion of the M-function / on E = [0,1]. Basically, we can use the method 
described in Section 2 above, but this method has to be modified. 

The problem now is more difficult than the problems discussed in Sec¬ 
tion 2. The condenser (E, F ) associated with the current situation E = [0,1], 
F = [—1,0] is degenerated, since the plates E and F have a common point, 
the equilibrium Xe — Xf distribution for such condenser does not exist (col¬ 
lapsing situation). 
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Stahl used the condenser with a logarithmic external field on the plate 
F (which comes from term \f—t in (18)). The external field prevents the 
equilibrium distribution from collapsing and the weighted equilibrium distri¬ 
bution may be used to define an optimal interpolating table. Stahl was able 
to obtain strong asymptotics for associated orthogonal polynomials. Then 
he obtained strong asymptotics for the error of approximation. As a result 
he proved in [55] the following remarkable theorem 

Theorem 4. For p n = p n (\x\, [—1,1]) we have 

lim p n e ny/r ™ = 8. (19) 

n—>• oo 

The result was conjectured by R. Varga [61] on the basis of numerical 
experiment. The correct constant c\ = C 2 = vr was earlier found by N. S. Vy- 
acheslavov [62]. 

2.3. Some problems and conjectures by Gonchar related to the 
p 2 -theorem. A broader context related to the p 2 -theorem is the general 
problem of characterizing classes of analytic functions through the rate of 
convergence of their best rational approximations. 

The corresponding problem for polynomial approximations essentially ad¬ 
mits a general solution and the associated theory is well known. For rational 
approximations the situation is more complicated. It is usually difficult to 
find a criterium in terms of best rational approximation since direct and 
inverse theorems are mostly far away from each other (cf. [22], [23], [24]). 

A typical example is related to characterizing the class of functions with 
supergeometric rate of convergence of best rational approximations 

lim p n (f, E) 1 /™ = 0. (20) 

n—> oo 

A direct theorem by Ch. Ponmrerenke [44] asserts that, if / € i7(C\ e) and 
cap(e) = 0, then (20) is valid for any E C C \ e. The inverse is not true. 
Basically, knowing (20) we cannot say anything about the set of singularities 
of /. 

On the other hand, Gonchar proved in [12] that (20) implies that / is 
quasianalytic (there is a uniqueness theorem for such functions similar to 
the one for analytic functions). He also proved [12] that the function / is 
single-valued in all of its Weierstrass domain if (20) is satisfied. There are 
more theorems by Gonchar of that kind; see [13], [14] for details. 

Soon after the p 2 -theorem for M-functions was proven, Gonchar raised the 
following general question: for what kind of functions the limit of p n {f , E) l / n 
as n —» oo exists and is positive? In other words, which functions have 
regular behavior of the sequence of their best rational approximations. His 
basic idea was that all the “natural” functions are regular and for any such 
function we have 

lim p n (f,E) 1/n = p(f,E) 2 (21) 

n—>■ oo 

for any continuum E in the domain of the function (see [17], [18], [25], [28]). 
In other words, if lim p n (f, E) 1 / 12 exists then it is equal to p(/) 2 . All subse¬ 
quent results seem to confirm the conjecture but it is not clear how it may 
be proved. 
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Anyway, Theorem 2 means that Markov-type functions are regular ((21) 
is satisfied under mild restrictions on the measure), which was also an im¬ 
portant argument in favor of Gonchar’s p 2 -conjecture that any function 
/ G M(C \ e) with a finite set e is a regular function (a stronger version is 
contained in the Gonchar-Stahl theorem). In particular, he also conjectured 
that all algebraic functions are regular and for any E free of singularities we 
have the stronger estimates 


0 < (?!(/,£) < < C 2 (f,E) 


( 22 ) 


p(f,E) 2n 

(this Gonchar conjecture seems to be still generally open, even though within 
the range of existing methods). 

Another general conjecture by Gonchar was that, if a function has worse 
than p 2 -rate of best rational approximation and lim p n (f, E) 1 / n > p(f, E) 2 , 

71—XX) 

then it is “not regular” and there is another subsequence where the rate is 
better than “normal”; that is, lim p n (f,E) 1 / n < p(f,E) 2 . In particular, 

71—XX) 

his conjecture was that for any function / we have 

lim p n (f,E) 1/n ^ p(f, E) 2 for any / G H(E). (23) 

71—XX) 

Later this conjecture was proved by O. G. Parfenov [42] and V. A. Prokhorov 
[45]—[47] who also obtained a stronger inequality 


lim n p k (f,E)V k ^p(f,E¥ 

71—XX) - 1 - ■ ± - 

k=1 


(24) 


The proofs of the theorems by Parfenov [42] and Prokhorov [45]—[47] were 
based on a combination of fixed poles of interpolation and theorems of sin¬ 
gular numbers of Hankel operators. This is essentially another important 
direction in approximation theory in many ways different from the one under 
consideration and we do not go into further details. 

3. Stahl’s theorem on Pade approximants 

So far the rate of convergence of best rational approximations was dis¬ 
cussed. Now we pass to the convergence properties of the approximating 
functions; see (11) and (12) as examples. The construction of near-best ra¬ 
tional approximation r n (z) to / in the context of Theorem 1 may also be 
arranged in such a way, that these functions converge to / uniformly on 
compact subsets of the whole extremal domain of analyticity of /. 

The convergence problem for rational approximations is more convenient 
to discuss for the case of (diagonal) Pade approximants, the best local ra¬ 
tional approximants to a power series. It is also convenient to select an 
interpolation point at infinity, so that all branch points of the function are 
finite. 


3.1. Pade approximants for functions with branch points. Let 

OO p 

n =0 


( 25 ) 
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be a function analytic at infinity. Pade approximants n n (z) = ( P n /Qn)(z ) 
to / are defined by the condition 

Rn(z) '■= (Qnf - Pn)(z) = O (l/z n+1 ) , Z ->• OO, (26) 

where P n , Q n G P n and Q n ^ 0 (see [ 6 ] , for details). In connection with 
best rational approximations we note that the n n (z) are limits of the best 
approximations on Dr = {z : \z\ ^ R} as .R —>• oo [65]. In this sense Pade 
approximants are the local version of best rational ones. Function R n is 
called the remainder. 

3.1.1. Markov theorem. An old classical convergence theorem proved by 
A. A. Markov [36] in 1895 asserts that, if f(z) is an M-function (5), then 
the associated sequence it n (z) converges to / uniformly on compact subsets 
of the complement of F (the minimal interval containing the support S a 
of the measure a). Note that / is holomorphic in the domain Q = C \ S&, 
which may be larger than C \F. The functions 7 r n may have poles in this 
larger domain, but they still converge there in capacity. 

The fact that Pade denominators Q n are orthogonal polynomials with 
respect to a had been discovered earlier in 1855 by P. L. Chebyshev [7]. 

3.1.2. Nuttall’s minimal capacity conjecture. One of the main problems in 
the theory of Pade approximants in the period 1960-1970 was the conver¬ 
gence problem for functions with branch points. If element (25) at infinity 
represents a function / G M(C \ e), where e is, say, a finite set of branch 
points, then Pade approximations to / may converge to / only in a domain 
where / is single-valued. What is actually the domain of convergence? 

The first results on the convergence of Pade approximants for functions 
with some special type of branch points were obtained by J. Nuttall who 
also made the following conjecture (see [40], [41]). Let / G M(C \ e) where 
e is a finite set and 


T = {F C C : / <E H(C\F)} (27) 

be the set of compact cuts F, which makes / single-valued. Let, further, 
Ff G F be the cut of minimal capacity 

cap (Ff) = min cap(F). (28) 

Nuttall’s main conjecture was that the sequence {7r n } converges to / in 
capacity in the complement to Ff 

7T n C 47, ZGC\F f . 

He also formulated a conjecture on strong asymptotics of Pade denomina¬ 
tors, which he proved in some particular situations [40], [41]. 

3.2. Stahl’s theorem. A general theorem on the convergence of Pade ap¬ 
proximants including, in particular, Nuttall’s conjecture, was proven by 
H. Stahl [50]—[53] . Here is the original statement of the theorem where the 
compact set Ff of minimal capacity is characterized equivalently in terms 
of the 5-property. 
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Theorem 5. Let e be a compact set of zero (logarithmic) capacity cape = 0 
and f € A e = A(C \ e) is not single-valued in C \ e. Then the following 
assertions {A), ( B ) and ( C) hold 

( A ) There exists a unique compact set F = Ff in the plane, which is the 
union of analytic arcs (up to subsets of capacity zero), with the following 
properties. The complement of F is connected, f is single valued in C \ F 
(so that F € F(f)), the jump of f across any arc in F is not identical zero 
and, finally, the equality 


(2) ■ * e F ° (29 > 

(called S-property) holds for the Green function g = g(z, oo) of C \ F with 
pole at infinity, where F° is the union of the open parts of the arcs consti¬ 
tuting F (ni,n ,2 are two oppositely directed normals to F° at the point z). 


( B ) For the Pade denominators Q n associated with f we have X{Q n )/n A 
A, where A = A p is the Robin measure of the compact set F. 

(C) The sequence of Pade approximants n n = P n /Q n associated with f 
converges in capacity to the function f inside (i.e., on compact subsets) of 
the domain D := C \ F. 


The exact rate of convergence in capacity was also included in the theo¬ 
rem, but our further discussion is related to assertions (A) and ( B ). Part 
(C) is essentially a corollary of (B). 

The most important part of the theorem is part ( B ). Rather sophisticated 
and entirely original potential theoretic methods were used in this part of 
the proof. The starting point was the following orthogonality condition for 
Pade denominators Q n 


(f Q n (z)z k f(z)dz = 0, k = 0, 1,... , 


77,-1, 


(30) 


J F 

where integration is taken over any system of contours separating F from 
infinity. Very important is that Stahl’s proof was the first instance of the ef¬ 
fective use of complex orthogonality and this was a significant breakthrough 
in the theory. 

Another interesting fact about Stahl’s proof is that additional assump¬ 
tions do not lead to any simplifications. The proof of the theorem for a 
set e with three branch points (not on the line) is identical to the original 
proof for sets e of capacity zero. Additional assumptions on the character of 
the branch points do not bring any simplifications either (one exception is 
a square root of a rational function). It seems that this part of the theorem 
does not have any simple complex particular cases for branch points not on 
the line. The case when the branch points are on a line may be essentially 
viewed as part of the Markov theorem (some additional assumptions are 
formally needed). 

It is not possible here to go into any further details related to this part 
of the proof. We have in mind that the Robin measure A = A^ of the 
extremal compact set F = Ff represents the limit zero distribution of the 
Pade denominators and concentrate on the further characterization of F 
(geometric component of the problem). Part (A) of the theorem essentially 
defines F by the S'-property. 
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Recall that the similar part in the proof of the p 2 -theorem was represented 
by a Green equilibrium problem. In more general situations, the geometric 
component may be represented by a more general kind of 5-equilibrium 
problems. Then existence itself would be a problem. Constructive solutions 
is another problem. To some extent, further progress in the theory depends 
on the development of the geometric component of the method. 


3.3. ‘Geometry’ of Stahl’s theorem. The geometric part in Stahl’s the¬ 
orem is a particular case of a general S'-equilibrium problem; it is the case of 
single logarithmic potential and the zero external field. The extremal com¬ 
pact set F f in such settings always exist and it has comparatively simple 
constructive characterization. Similar characterization (in case of existence) 
may be obtained in more general cases of pure logarithmic or Green po¬ 
tential and an external field, which is harmonic outside of a set of capacity 
zero. Other cases are essentially open; see [49]. Next we discuss a case 
when set e of singularities of / is finite (and there is no zero external held). 
Formulae related to the case are explicit. At the same time the case is still 
representative and may shed light on the nature of the problem at large. 


3.3.1. Quadratic differential. Let e = {ai, 02 , •.., a p } be finite set of distinct 
points; we denote A(z) = (z — a\)(z — 02 ) • • • (z — a p ). Let / G A e = M(C\e) 
and Ff be the associated extremal compact. In the sequel we call it 5- 
compact or Stahl compact. The following characterization of Ff is valid. 

There exists a polynomial Vf 

Vf(z) = (z - vi)(z - v 2 ) ■ ■ ■ (z - v p - 2 ), where Vj = vj(f ) 


depending on / and e of degree p — 2 such that the 5-compact set Ff is the 
union of some critical trajectories of the quadratic differential — (V/A) (dz) 2 , 
where V = Vf. 

The assertion follows from Stahl’s results [50]-[52]. However, the state¬ 
ment is close to some traditional theorems in geometric function theory [58] . 
An alternative proof based on a Max-Min energy problem was presented 
in [43]; see the review [49] for more details. 

In addition, the following condition is satisfied: —(V/A) ( dz ) 2 is the qua¬ 
dratic differential with closed trajectories (in the terminology of [58]). In our 
particular case it means that all its trajectories, defined by the inequality 
— (V (z) / A(z))(dz) 2 > 0 are either closed contours or critical-analytic arcs, 
connecting some pair of zeros of AV. 

Moreover, function y/V(z)/A(z) has a holomorphic branch in H = C \Ff 
and the Green function g for H with pole at infinity can be written as 


g(z) = ReG(z), 



(a <E e), 


(31) 


(the branch of the root is such that g(z) = log |z| + o(l) at infinity). The 5- 
property (29) of the Green function follows directly from this representation. 

Representation (31) establishes one-to-one correspondence between 5- 
compacta Ff and polynomials Vf. Zeros of Vf may serve as coordinates 

of Ff. 
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3.3.2. Family of polynomials Vf, f € A e . The problem of a constructive 
determination of the compact Ff for a given / has two components. This 
compact depends, first of all, on the branch set e of the function / 6 A e - It 
depends also on the branch type of the function (determined by indicating 
the loops, along which the analytic continuation of the original element given 
at the infinity point leaves it unchanged). It is convenient to separate the 
two dependencies by introducing the family of compacta Ff associated with 
all functions / S A e having fixed set e of branch points. 

It is not difficult to prove that this family is finite. The number of its 
elements depends on the number of points in the set e and their configuration 
(we do not discuss calculation of this number). Now we concentrate on 
characterization of this family. 

Since each compact Ff, is uniquely defined by associated polynomial Vf, 
the whole family Ff, f G A e may be described in terms of associated family 
of polynomials Vf, which we denote by 

V(e) = {V f : f€A e } 

Polynomial V E V (e) is determined by its roots, that is, by p — 2 complex 
numbers vj playing role of coordinates. We may ask, therefore, if some kind 
of equations may be written in terms of coordinates vj. Some equations 
may, indeed, be derived from the characterization of S-compacta as criti¬ 
cal trajectories of quadratic differentials (see Section 3.3.1 above). Those 
equations (written in terms of periods of quadratic differentials) belong to 
a well known class of equations and such equations are usually not easy to 
deal with. In particular, there is a difficult combinatorial element in their 
structure and the detailed analysis of the situation may not be presented 
here. Below we outline briefly two possible two ways the problem may be 
approached without going into all the details. 

In the next subsection, we introduce a family of hyperelliptic Riemann 
surfaces associated with the family Ff, f € A e of Stahl compacta. In terms 
of this family of Riemann surfaces we define a mapping in the set of monic 
polynomials of degree p — 2. Then polynomials V € V(e) are defined as 
fixed points of this mapping. It seems that nothing related to this approach 
has been published so far. 

It is possible that a natural way to generalize Stahl theorem for Hermite- 
Pade approximation goes through a proper generalization of this approach. 
Anyway, the associated generalization of the family of Riemann surfaces is 
already known at least for simple situations. 

In Section 3.4 we discuss an approach to the problem of constructive 
description of S'-compacta based on the embedding of the set of Robin mea¬ 
sures associated with compacta Ff, f E A e into a larger space of probability 
measures in plane, which we call e-critical measures. Those measures con¬ 
stitute a connected finite dimensional variety and its structure may help to 
better understand the structure of the discrete set of Robin measures for 
Ff, f E Ae (see [37]). 

Later, in Section 4 we also use critical measures to study the problem of 
stability of convergence in Stahl’s theorem under variations of the function 
/ preserving the set of branch points. 


GONCHAR-STAHL p 2 -THEOREM 


17 


3.3.3. Family of Riemann surfaces IZf, f G A e . The S-property (29) is 
essentially equivalent to the fact that the real Green function g(z) of the 
domain fl = C \ Ff has a harmonic extension to a hyperelliptic Riemann 
surface 7Z = IZf, which may be defined as the Riemann surface of the func¬ 
tion a JV/A with V = Vf. We interpret IZf in a standard way as a two 
sheeted branched covering over C. Formula (31) provides a constructive 
form of this extension. 

Recall that on any hyperelliptic Riemann surface 7Z there exist a unique 
function g = gn : 1Z —>• R, which is uniquely defined as harmonic function on 
the finite part of 7Z with asymptotics g(z) = log \z\ + o(l) as z —>• ooW and 
g(z) = — log \z\ To(l) as z —>• oo^ 2 ^ and with normalization g(z^ 1 ' ) )+g(z^) = 
0. We call this function g = g-jz the ^-function for the Riemann surface 1Z. 
Continuation of the Green function g(z) from the domain C \Ff with / G A e 
to the Riemann surface IZf is exactly the g-function for IZf. 

Consequently, the complex Green function G in (31) has (multivalued) 
analytic continuation to 1Z, which is a standard third kind Abelian integral 
on 1Z with (logarithmic) poles at oo^ and oc/ 2 ) and divisor 1,-1 (we call 
it G -function for 1Z). 

Representation G'(z) = \JV(z)/A(z), where V = Vf, asserted in (31) for 
z G C\Ff is valid for 2 G TZ. The extremal compact set Ff is the projection 
of the zero level {C : 5 '(C) = 0}c7v!.of g- function onto the (extended) plane 
C. 

Now, together with the collection of S'-compacta and associated family 
of polynomials V(e) we also have the family of Riemann surfaces 7Z(e) = 
{IZf : f G A e }- Next, we will obtain a representation of V(e) in terms of 
this family. 

Recall that we begin our constructions with a fixed polynomial A(z) = 
z p + ... having simple roots. Next, consider a variable polynomial V(z) = 
z p ~ 2 + ... (at the moment we do not have relate V to the constructions 
above). However, assume for now, that the zeros of V are simple and A 
and V do not have common zeros. Then, Riemann surface of the function 
y/V/A is a generic hyperelliptic Riemann surface of genus p — 1 having 2p—2 
quadratic branch points at zeros of AV. It is well known that the G-function 
for such surface may be written in the form 


G(z) 



W(t)dt 

vmv(ty 


where W{z) = z p 2 + ... 


(32) 


(a is a root of A). The polynomial W is uniquely determined by the poly¬ 
nomial AV. Since A is fixed, this defines a mapping : V —> W. It is yet 
defined under the assumption that zeros of V are simple and different from 
roots of A, but the mapping has continuous extension to the whole space 
Pp^ 2 °f monic polynomials of degree p — 2 (actually, we need only the re¬ 
striction of the mapping d> : Pp^ 2 ^p -2 t° the space of polynomials with 
zeros in the convex hull of roots of A). 

It follows from (32) that G' = W/y/AV. On the other hand (we return to 
original settings), if V G V(e) then we have G'(z) = \JV{z)/A{z) according 
to (31). Combining the two representation we obtain W = V. In other 
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words, V G V(e) implies that the polynomial V is a fixed point of the 
mapping <f>. Reciprocally, any fixed point of <J> is in V(e) and, therefore, 
V(e) is equivalently defined as set of fixed points of the mapping 4>. 

It is generally possible that polynomial V has common zeros with A. Then 
those common zeros are canceled in the ratio V(z)/A(z) and the problem is 
reduced to a similar one with a smaller set e (of roots of A). Such reduction 
would not be a significant event. For instance, let p = 3 and roots of A 
be collinear. Then the root of V will cancel the middle root of A and the 
problem reduces to the one with p = 2. Cancellations of the other two roots 
of A are banned by assumption that all roots of A are branch points of the 
functions / £ A e . 

Reduction of the genus of the surface 7Zf may also be the result of the 
presence of multiple zeros of V and this is a common occasion, which has 
an important meaning. All, except for maybe one, polynomials V G V(e) 
have multiple roots. Loosely speaking, this fact is a reflection of a possible 
variety of branch types of the functions / G A e . Anyway, combinatorics of 
the set V(e) is in part determined by multiple roots of V. 

Suppose that / has a generic branch type, that is, continuation along any 
nontrivial loop leads to a different branch. Then, associated iS-compact Ff is 
a continuum; it is, therefore, the Chebotarev continuum for e. In a situation 
of a “common position” for configuration of points in set e, polynomial V° 
associated with the Chebotarev continuum will have simple zeros (it may be 
viewed as a definition of a “common position”). In such a situation is the 
only fixed point of the mapping 4> with simple zeros. All other polynomials 
V G V(e) will necessarily have multiple roots and, so, reduced genus of 
associated Riemann surfaces. We do not go into further details. Discussion 
of the structure of set V (e) is continued in the next section from a different 
point of view. 

In conclusion of this section we make the following remark. In case of a 
finite set e the Stahl’s theorem may be equivalently formulated in terms of 
convergence of the remainder R n in (26) on a Riemann surface (in particular, 
this gives an alternative approach to the way of introduction of the Riemann 
surface 7Z = IZf). 

The theorem may be stated as follows. For a given / G A e there exist 
a hyperelliptic Riemann surface 1Z such that (with proper normalization) 
sequence ^log|i? n | converges in capacity on 1Z to the g-function of 1Z. In 
the equivalent form: the sequence of normalized logarithmic derivatives -^r- 
converges to G'(z) in the plane measure on 1Z. Then, the surface 7Z is 
uniquely defined by the additional conditions that the projection F of zero 
level of g onto the plane makes / single-valued and also a jump of / across 
any arc from F is not identical zero. 

It is possible that in such form the Stahl’s theorem may be directly gener¬ 
alized for the first kind Hermite-Pade approximants for systems of functions 
with branch points. 

3.4. Critical measures Ai e . For a finite set e = {ai,...,a p } we de¬ 
fine e-critical measures as critical points of the energy functional £{p) = 
— Jlog|x — y\dp{x)dp(y) with respect to local variations with fixed set e. 
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More exactly, for a smooth complex function h(z ) in a neighborhood of 
supp g we define point variations z — >• z 1 = z + th(z), where |t| G (0, e), and 
then variations of the measures g —>• // by dg(z) = dg t {z t ). 

An associated variation of energy (derivative in the direction h) is defined 
by 

DhS(g) = t lim + j (£ (/f) -S(n)). (33) 

Finally, we say that g is e-critical, if for any function h satisfying condition 
h(a) = 0 for any a G e, we have Dh£(g) = 0. The set of all such measures 
is denoted by M. e . 

Critical (stationary) measures were first introduced in [29] and then used 
in [43]. A systematic study of critical measures (with rational external 
fields) was presented in [37] in connection with zero distribution of Heine- 
Stieltjes polynomials; see also review [49]. Here we use critical measures as 
an approach to describe the set of Robin measures of 5-compact sets Fj 
associated with a fixed set e. Later in Section 4 they are also used to study 
stability of convergence in Stahl’s theorem. 

It is important to observe, first, that Robin measures of all 5-compacta 
Ff are e-critical measures; second, basic properties of Robin measures of 
5-compacta are preserved for critical measures. In particular, the potential 
of any e-critical measure /i has the 5-property presented by (29) with U p 
in place of g. Next, for any critical measure g there exist a polynomial 
V(z) = fiy=i — v j) such that with A(z) = \\ p k=l {z — a k ) we have 


U fl (z) = Re f y/V(t)/A(t) dt, dg{z) 
J a\ 


l 

7r 


■\fvJ~Adz 


(34) 


Moreover, supp g is a union of critical trajectories of — (V{z)/A{z)) ( dz ) 2 
and this differential has closed trajectories just as for the Robin measures 
of 5-conrpact sets Ff. Finally, both sets of measures may be characterized 
in terms of the associated polynomials V. 

Using the zeros Vj of V as parameters we represent the set A4 e of critical 
measures as a subset in the space of vectors {v = (ui,..., v p - 2 )} from C p-2 . 
In these coordinates A4 e is represented as a union of 3 P ~ 2 bounded bordered 
domains, which we call cells. Each cell is a bounded bordered manifold of 
real dimension p — 2. Interior points of each cell correspond to measures g, 
whose support F = supp g consists of exactly p — 2 simple disjoint analytic 
arcs Tj with endpoints from the set {a k , Vj }. Finally, v coordinates of 
measures g G M e are defined by systems of equations 

Ref y/V(t)/A(t) dt = 0, j = 1,... ,p - 2; V(t) = (t - «i) • • • (t - v p - 2 ). 
Jr, 

(35) 

Particular cell is identified by homotopic type of arcs Fy. 

Robin measures of 5-conrpact sets are among e-critical measures and their 
representations in terms of w-coordinates are located on boundaries of cells. 

The space A4 e is connected and each critical measure may be, in a stan¬ 
dard way, connected with the Chebotarev continuum associated with e, 
which may be defined as the only continuum (closed connected set) in the set 
of 5-compacta for functions / G A e . The roots of the polynomial Vq G V(e), 
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associated with the Chebotarev continuum, effectively play the role of origin 
in the u-coordinate system and the corresponding “deformation theory” is 
in part described in [37]. 

Further, equilibrium measures of 5-compact sets satisfy (35) and also p —2 
additional equations, which distinguish them among all critical measures. 
Potential of any interior critical measure p, supported on arcs Tj, keeps 
constant value on those arcs, that is, we have 

v -1 

U^(z) = Cj , z G Tj, supp n = [J T y 

3 = 1 

The collection of constants C = {C \,..., Cp— 2 ) may be used to parame¬ 
terize points in a particular cell in A4 e (note that only p — 2 of the con¬ 
stants are independent). The differences of those constants correspond to 
the parametrization of a cell by “height of cylinders” in terms of the general 
moduli problem, which may be associated with critical measures. There is 
a dual parametrization by “lengths of circles” which corresponds to masses 
M r j) (see [58]). 

Now, additional equations, which determine equilibrium measures of S- 
compacta in terms of C-coordinates, are 

Ci = • • • = C p -\. 

We have a total of 2p — 2 real equations for the same number of real param¬ 
eters in V. For some further details see [58], [37], [49]. 

4. Some generalizations and conjectures 

The method outlined above may be developed in several directions. In 
this section we make a few remarks related to possible generalizations. 

4.1. Dependence of Pade denominators from the function. Let / € 

A e be a function with a finite set e of branch points defined by an element 
(25) at infinity. Let it n (z) = ( P n /Qn)(z ) be the associated Pade approxi- 
mants at infinity. 

Suppose that we make a small variation of the function / in class A e . In 
other words, consider a function / G A e , which is close to / in some sense, 
the location of its branch points is the same, but their character may change. 

We want to figure out how much the denominator Q n = Q n (f) will 
change, say, for a fixed large enough n. To be more precise, here we have 
in mind a significant change, and as a first step toward investigation of the 
problem, we will discuss a possible change in the limit zero distribution. 
Since the rate of convergence in Stahl’s theorem is determined by the limit 
zero distribution, the problem is essentially about the (rough) stability of 
the convergence in this theorem. 

It turns out that the answer depends on what exactly was the S-compact 
set Ff for the function /. As usual, assume for simplicity that all points 
0 ^ G e, k = 1,... ,p are actual branch points for /. 

If Ff was the Chebotarev continuum F e for e (in other words, function / 
has a “generic branch type”) then small variations of / will not produce a 
dramatic effect. It is not difficult to see that any such small enough variation 
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of / will remain to be of a “generic branch type” and, therefore, will have 
the same 5-compact and the same limit distribution (the Robin measure of 
this compact). Thus, the dependence of Q n on / is essentially continuous 
(and asymptotically continuous). 

If Ff was any other 5-compact set, then this dependence is not continuous, 
since dependence Ff from / is not continuous if Ff was not the Chebotarev 
compact for e. The branch types, which may be obtained by small variations 
of /, depend on Ff. Anyway, it is clear that the generic branch type may be 
obtained form any other one using arbitrary small variations and it is enough 
to prove discontinuity of Ff as function of / at any / whose 5 compact is not 
the Chebotarev’s one. All facts above are still corollaries of Stahl’s theorem. 

The situation changes if we consider a sequence of variations depending 
on n, which converges to zero as n —>• oo. What exactly will happen with 
zero distribution depends on characteristics of the function, characteristics 
of the variation and the relation between n and the magnitude of variation 
(it is possible to consider also variations of the locations of branch points, 
but the effect will be similar). 

Formally, let /, f n E A e for n E N and the sequence f n converges to / as 
n —>• oo. Let Q n = Q n (f n ). From what was said above follows that, if Ff 
is the Chebotarev continuum for e, then the sequence ^X ( Q n ) converges 
weakly to the Robin measure of this continuum. 

If Ff is different from the Chebotarev continuum for e the sequence 
—X (Q n ) is not generally weakly convergent. We may claim that only the 
weak-* limit of any convergent subsequence belongs to the set M e of critical 
measures for e. The measures, which are included in the limit set for a given 
/, depend on Ff and the character of convergence. However, using different 
functions / E A e we may obtain any p E A4 e as a limit along the whole 
sequence. In other words, any p E A4 e is a weak limit of the whole sequence 
};X (Q n ) for some selection of functions /, f n E A e . 

To state a theorem formally presenting assertions above we have to define 
the convergence f n —>• /. We give a simple example of such a theorem with 
particularly simple kind of convergence. Consider the following model class 
of functions 

{ p I P 

f'■ f(z) = Y[(z ~ a kT k > , ^2a k = 0. (36) 

k =1 ) k =1 

We assume that e = {a k } is fixed and a k are parameters; as usual we assume 
that each a k is an actual branch point of the function (a k is not an integer). 
We have C e C A e and the class C e is representative enough in the sense 
that all possible branch types are presented by functions from C e . The 
convergence f n —>• / for functions from C e is understood as the convergence 
a k , n —> ot k of the a-parameters of f n to those of /. 

Now we can state a version of the theorem related to the class C e . 

Theorem 6. Let the sequence of functions f n E C e converge to f E £ e . 

If the extremal compact set F = Ff for f is the Chebotarev continuum F e 
for e, then ( Q n (fn)) —>• A, where A is the Robin measure for F. 

For any p E M. e there exist a convergent sequence of functions f n £C e ^>- 
f E C e such that ^X (Q n (f n )) A p, as n —> oo. 
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Theorem 6 generalizes the Stahl theorem in the same way that Theorem 1 
in [29] generalizes the /^-theorem. Proofs of both theorems may be based 
on the description of the set of critical measures A4 e outlined above and also 
on Theorem 1 from [29] . In the next sections we briefly discuss this theorem 
and some of its possible generalizations. At the same time Theorem 6 may 
be proved in a very simple way using Laguerre type differential equation for 
Pade denominators of functions from C e . 


4.2. Conjectures on zero distribution of complex orthogonal poly¬ 
nomials. Here we present some conjectures connected to Hermite—Pade 
polynomials (their circle of applications may be larger). Thus, we touch a 
general problem of generalizations of the theory outlined above in this pa¬ 
per for the case of Hermite-Pade polynomials. This is one of the central 
problems in the theory and at the moment the problem is essentially open. 

As a starting point we need a version of a general theorem from [29] (GRS 
theorem), which is for the moment, probably, the most advanced known 
theorem related to zero distribution of complex orthogonal polynomials. 


4.2.1. GRS theorem. To state the theorem we need the following definition. 

We say that a compact F C C has 5-property in an external field <p 
harmonic in a neighborhood of F, if equality in (29) holds for g = U x + ip - 
total potential of equilibrium measure A = A v p f° r F in the external field 
ip. The 5-property implies that F is at most countable union of disjoint 
open analytic arcs F° and a set of capacity zero (here we assume from the 
beginning that an 5-compact associated with the problem exists). 

Now, we state assumptions of the theorem. 

We assume that we are given a domain Q in C, a compact set F in H 
and a sequence of functions 4> n (z) € H(Q), which converge <3? n (.z) —>• $(z) 
uniformly on compact subsets of f! as n -> oo. 

Assume that F has 5-property in the external field <p = Re$( 2 ). 

Further, let / £ H{Q. ~ F) be a function whose jump across any arc 
from F° is not identical zero and polynomials Q n (z) £ P n be defined by 
orthogonality relations with weights f n = /e^ 2r '' $n 


® Q r 


(z)P(z)f n (z) dz = 0, 


for any polynomial P £ P n -i- (37) 


Jf 

Integration in (37) goes along a contour(s) in Q\F, homotopic to the bound¬ 
ary of C \F. 

Finally, assume that the complement to the support of the equilibrium 
measure A = A ^ p for F in the external field <p is connected. 

The following is Theorem 1 from [29]. 


Theorem 7. Under the assumptions above we have ^ X ( Q n ) A A. 

Orthogonality conditions in (37) above are rather general, but in a number 
of situations the theorem may not be directly applied. It happens often in 
the study of zero distribution of the Hermite-Pade polynomials. These 
polynomials are defined by systems of orthogonality relations and reduction 
of such systems to orthogonality with respect to single weight (if possible) 
lead to more general forms of orthogonality. Next we give two comparatively 
simple examples of different nature. 
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4.2.2. A conjecture related to Hermite Pade polynomials for a Nikishin sys¬ 
tem. In many cases study of Nikishin systems may be reduced to a problem 
of asymptotics for orthogonal polynomials Q n , defined by relations similar 
to (37) in theorem (7), but with the weight functions f n in (37) depending 
not only on n but also on the polynomial P. In other words, polynomials 
Q n are orthogonal to some collection of functions, which are not pure poly¬ 
nomials, but polynomials P with multiplier f n . depending not only on n, 
but also on P. 

We formulate a conjecture for the case when only 4> n depend on P. 

Let polynomials Q n satisfy orthogonality conditions (37) with <& n (z) = 
<h n (z;P). All the assumptions of Theorem 7 above are preserved. In addi¬ 
tion, we assume that for any sequence of polynomials P € P n _i such that 
i X (P n ) A- A we have 4>„(z; P n )) $(z). 


Conjecture 1 . Under the above assumptions we have ^X(Q n ) — >• A. 


Conjecture 1 is a part of joint work with S. Suetin (in progress). It is 
partially suggested by the results of the paper [48], where Hermite-Pade 
polynomials of the first kind were considered for a Nikishin system of two 
Markov-type functions fi,f 2 on the union E of a finite number of disjoint 
real closed intervals Ej. We outline settings of the paper without going into 
all the details related to the situation. 

In the paper [48] it was assumed that the ratio of two jumps f(x) := 
Af 2 (x)/Afi(x), x € E, is an analytic comp lex-valued function on E and / 
has an analytic continuation from each Ej along any path in C avoiding the 
finite set ey of the branch points of /. It was also assumed, that the set ef 
is symmetric with respect to real axis. 

First it was proven that (under some additional technical assumptions) 
there exists a unique compact set F, such that / € H (C \ F) and F has 
S-property with respect to some related equilibrium problem for a mixed 
Green-logarithmic potential. 

Let Qn,o, Qn,i, Qn ,2 € P n , Q n ,2 ^ 0, be the Hermite-Pade polynomials of 
the first kind for the system [1, /i, A], that is, the following relation holds 


(.Qn, 0 ' 1 T Qn,lfl T Qn,2f2)(z) 


o 


(A- 

l z 2n+2 


z -A oo. 


(38) 


The following orthogonality relation of type (37) for the polynomial Q n p 
was obtained in [48, formula (119)], 



hn+m (z') 


Qn(z) 



gn(C)A(C) dm n (C) tf _ 
Pn( C) /( 


dz = 0 (39) 


(P n € P n _i is an arbitrary polynomial). Finally, these orthogonality rela¬ 
tions were used to prove that the sequence \X{Q U) 2 ) weakly converges to 
the equilibrium measure for the problem mentioned above; for more details 
see [48]. 

Connection of this result with the Conjecture 1 is established by the 
following fact. The function in curly brackets in (39), which plays role of 
multiplier for P n , satisfies conditions in Conjecture 1. Thus, the theorem 
above supports the conjecture. 
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4.2.3. A conjecture on incomplete complex orthogonal polynomials. Hermite- 
Pade polynomials also lead to another type of asymptotics problems for or¬ 
thogonal polynomials. Before discussing this problem (in the last section 
below), we introduce an auxiliary problem for complex orthogonal polyno¬ 
mials. The problem may have, however, an independent value. We restrict 
ourselves with the simplest possible version of the problem. 

Let f(z) £ A e , where e = {a, b}. That is, function / (defined by an 
element at infinity) has two branch points at a and b A a - Let N and n ^ N 
be two natural numbers and the two polynomials Qn £ Pat satisfy relations 



Q n (z)P(z) f(z)dz = 0 , 


for any polynomial P £ P„_i, (40) 


where F is a curve connecting a and b. Note that here we do not assume that 
a special curve is given. Any curve F from the class F of curves connecting 
a and b may be used in (40) by the Cauchy integral theorem. Finding a 
special curve T £ T will be a part of the problem. 

Suppose that n, N -A oo in such a way that N/n -A k > 1. What can be 
said about the zero distribution of Qjv? 

Clearly, under these assumptions the polynomial Qn is not uniquely de¬ 
fined and we cannot expect that the sequence of counting measures L X (Qn) 
is convergent. Instead, we suggest that any limit point of this sequence sat¬ 
isfies certain inequality. To state formally this inequality, we first need to 
select a convergent subsequence 

1 * 

— X (Qn) -a /t as n -A oo, n€ A (41) 

n 

(A is a sequence of natural numbers). Since N/n -A k = 1, we have //(C) = 

k > 1. 

The potential tp = IP 1 of /i will play the role of external held in the 
problem we are going to consider. We denote by 


£^{v) = £{y) + 2 j dv 

the weighted energy of a measure v in the external held <p. Note that here 
and in the sequel we use the abbreviated notation: £fj,(v) stands for £^{y) 
with ip = [A 4 (compare to (17)). 

For a hxed F £ T we denote by € M(F) the minimizing (equilib¬ 
rium) measure on F in the external held p = U^ and the equilibrium energy 

by 

= __ min F) £„M, 

where Ni(F) is the set of probability measures on F. 

Next, we introduce the functional of equilibrium energy £ /t [F ] and assert 
existence of a compact set T = £ T maximizing this functional (see [49]) 


Sfj, [r] = max £fj,[F] where £^[F]=£ I1 { (42) 


Finally, we dehne a mapping /i -a A in the space of probability measures in 
the plane by 


X(p) = A (n,F) = Ar lM , 


( 43 ) 
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where T = is the extremal compact in (42). The conjecture is formulated 
in terms of this function. 

Conjecture 2. For any subsequential limit g of the sequence A X (Qn) we 
have n ^ A(/x). 

In a number of situations, Conjecture 2 can be proven. We mention one 
such situation where the proof may be obtained using the GRS-method. 
Suppose that the limit distribution is known for a part of zeros containing 
N — n zeros. Let this limit distribution is represented by a known measure 
a. In other words, we assume that N — n have factorization Qn = q n Qn , 
where the sequence of polynomial g n G Pjv_n has a limit distribution cr; 
formally ± X (g n ) A a. 

Suppose also, that the class F of continua F connecting a and h contains a 
continuum T with S'-property in the external field g(z) = ^U a (z). Then the 
sequence A X (q n ) is weakly convergent to A = A^r according to Theorem 7. 
It follows that the sequence A X (Qn) converges to g = A + cr, and finally the 
sequence A X (Qn) converges to g ^ X. Since the equilibrium measure of an 
S-compact has the max-min-property, we have A = X(g) and the assertion 
of Conjecture 2 follows. 

Thus, in this situations the max-min definition of X(g) can be equivalently 
formulated in terms of the S'-property. In general, we have to define X(g) in 
terms of “max—min”, since the external fields associated with the problem 
may not be harmonic (even smooth) around the extremal compact. 

4.2.4. A conjecture related to Hermite-Pade polynomials for an Angelesco 
system. As an example of possible application of Conjecture 2 we men¬ 
tion the problem of zero distribution for denominators of the second kind 
Hermite-Pade approximants in Angelesco case. 

The Simplest settings are as follows. Let e\ = {ai, 6 i} and e 2 = { 02 , 62 } 
be two sets, where ai 7 ^ 6 j for i = 1,2 are given. Then, two functions 
/1 G A ei and A € A e2 are defined by their Laurent series at infinity. Assume 
that {aj, 6 j} are actual branch points of /j. Finally, a nontrivial polynomial 
Q = f? 2 n € IP 2 n is defined by the pair of conditions 

(Qfi - Pi)(z) = O (z n+1 ) , (Qh ~ P 2 )(z) = O ( z n+1 ) 

as z —> 00 , where Pi is the polynomial part of Qfi at infinity (i = 1 , 2 ). 

We assume that the couple of functions /i ,/2 (or, rather, couple of sets 
e i) e 2 ) present “Angelesco case”, which informally speaking means that e\ 
and e 2 are “well separated” (far enough from each other). The formal def¬ 
inition is presented below after related definitions are introduced. As an 
example, we note that if all branch points are real, then we define Angelesco 
case by the condition that the intervals (ai,&i) and ( 02 , 62 ) are disjoint. It 
is known that in such case the limit zero distribution of the sequence Q 2 n is 
defined by a matrix equilibrium problem on a pair of conductors F\ = [ai, 61 ] 
and F ‘2 = [ 02 , 62 ] (see [20] and [30] for Markov case). For the complex case 
we have to use a matrix S'-equilibrium problem, which is defined below. 

For i = 1,2 denote by F, the class of continua in the plane connecting 
points ai and 6 *. We consider the class of vector compacts F = (Fi,! 7 ^), 
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where F t G F,. For a fixed vector-compact set F = (F\. F2) € F we define 
the class of vector-measures 

M = {(mi, M2) : Hj€M ( Fj )}, 

where M (Fi ) is the set of probability measures on F l . The energy of the 
vector measure fl = (mi,M 2 ) is defined by 

£ (fl) = [mi, Mi] + [Mi, M2] + [M2,M2], M = (mi, M2), 
where [m, z/ = f V u dfi is the mutual energy of m and v. In a more general sit¬ 
uation, the energy of a vector measure is defined by a matrix A with constant 
elements a^-, i,j = 1 , 2 , so that the matrix-energy is £ (fl) = °y [Mi, M.?']- 
In our case the elements of matrix A are an = 022 = 1 and 012 = 021 = 1 / 2 . 
This is the positive definite matrix and, moreover, ^ 0 . It follows that 
for any F = (F\, F2) € F there exists a unique A G M, such that 

£[F] = £(X) = min £(fl), A = (Ai, A 2 ). 

The vector-measure A is the equilibrium measure for F associated with ma¬ 
trix A; £[F] is the equilibrium energy of F (see original papers [ 20 ], [ 27 ], [ 30 ]) 
and recent developments in [ 33 ], [ 34 ]). 

Further, there exists a vector-compact set T = (Ti,r2) € F maximizing 
the equilibrium energy 

£[f] = max £[F]. 
fe F 

The existence of maximizing vector-compact sets T may be proved by the 
methods presented in [ 49 ]. In general, it is not unique, but the associated 
equilibrium measure A = (Ai, A2) is unique. 

What can be asserted about the limit zero distribution of Hermite—Pade 
denominators Q211 essentially depends on the structure of T or, better to say, 
the structure of A. If the supports of Ai and A2 are essentially overlapping, 
then the vector measure A does not describe the zero distribution of the 
polynomials Q2n and the case under consideration is not an Angelesco case. 
In such situation the equilibrium problem has to be modified; we refer to 
papers [ 2 ] and [ 3 ] for further details. 

If the supports of Ai and A2 are not intersecting, then we have Angelesco 
case and we assume this condition in what follows (the case when there is 
a small - say, finite - intersection may be included, but we restrict our con¬ 
siderations to the disjoint situation). Now, the main hypothesis on the zero 
distribution of Angelesco Hermite-Pade polynomials is stated as follows. 

Conjecture 3 . We have ^-W(Q2n) —> Ai + A2 where A = (Ai,A2) is 
the equilibrium measure of the extremal compact T = (ri,r2). 

In a number of cases the conjecture is proved under some additional re¬ 
strictions. First, if the sets e\ and e2 are far enough the proof may be given 
on the basis of the GRS-method. This approach is simple and general. The 
number of functions and the number and character of the branch points are 
actually not important if the sets of branch points are well separated. 
However, it is difficult to obtain sharp estimates of the critical distance 
between sets. 
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Second, with some additional assumptions on the character of the branch 
points the strong asymptotics for Q 2n has been derived in [3] for two func¬ 
tions, each with two algebraic-logarithmic branch points. The proof uses 
the steepest descent method for matrix Riemann-Hilbert representation of 
Q 2 n • This method is sensitive to the number of functions and the numbers 
of their branch points. It is not clear if it may be generalized for arbitrary 
branch points. 

Now, a proof for an arbitrary Angelesco situation may be reduced to 
Conjecture 2 (or its proper generalization for more than two functions with 
any number of branch points). Such reduction would require some additional 
potential-theoretic considerations. 

We will mention the shortest way of reduction which is based on one more 
conjecture. 

Let A = (Ai, A 2 ) be the equilibrium measure of the extremal compact 
set T = (Ti,T 2 ). We define p = Ai + A 2 as in Conjecture 3. Assuming 
that the Angelesco case is in effect, the extremal vector-compact set T has 
the following important property: both components Ti and T 2 have the S- 
property in the external field <p(z) = U fl (z). Further, the S'-property may 
be rewritten as the “energy max-min property” and, therefore, the three 
measures above satisfy the following relation 

P> Ai = A(/r, J 7 !), p ^ A 2 = A(/i, F 2 ), (44) 

where both measures A(/x, F\) and \{p,F 2 ) are defined in (43) above and 
Fi = F(fi ) are the classes of admissible cuts for functions / 1 , / 2 . 

Of course, we actually have equality in (44) but we need inequalities to 
make stronger the inverse assertion. 

Conjecture 4. In the Angelesco case, the measure p = X\ + A 2 is the only 
positive Borel measure in the plane satisfying (44) with p(C) = 2. 

If both conjectures 2 and 4 are true, then Conjecture 3 is also true since 
it is a direct corollary of the first two conjectures. 

The author thanks the referee for the valuable remarks directed towards 
improving the style of the paper and also for indicating instances where 
additional explanations are desirable. 
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